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Preface to the Second Edition 



This second edition contains two additional chapters dealing with binomial, 
multinomial and Poisson models. If you have to analyze data sets where the 
response variable is a count or the distribution of individuals in categories, you 
will be interested in these chapters. Generalized linear models are usually used 
for modeling these data. They assume that the expected response is linked 
to a linear predictor through a one-to-one known transformation. We con- 
sider extensions of these models by taking into account the cases where such 
a linearizing transformation does not exist. We call these models generalized 
nonlinear models. Although they do not fall strictly within the definition of 
nonlinear regression models, the underlying principles and methods are very 
similar. In Chapter 6 we consider binomial variables, and in Chapter 7 multi- 
nomial and Poisson variables. It is fairly straightforward to extend the method 
to other distributions such as exponential distribution or gamma distribution. 

Maintaining the approach of the first edition, we start by presenting prac- 
tical examples, and we describe the statistical problems posed by these ex- 
amples, focusing on those that cannot be analyzed within the framework of 
generalized linear models. We demonstrate how to solve these problems using 
nls2. It should be noted that we do not review the statistical problems re- 
lated to generalized linear models that have been discussed extensively in the 
literature. Rather, we postulate that you have some practical experience with 
data analysis using generalized linear models, and we base our demonstrations 
on the link between the generalized nonlinear model and the lreteroscedastic 
nonlinear regression model dealt with in Chapter 3. For that purpose, the 
estimation method based on the quasi-likelihood equations is introduced in 
Section 3.3. The use of the nls2’s facilities for analyzing data modeled with 
generalized nonlinear models is the main contribution of the second edition. 

The modifications to Chapters 1 to 5 are minor except for the bootstrap 
method. Indeed, we propose an extension of the bootstrap method to lret- 
eroscedastic models in Section 3.4, and we apply it to calculating prediction 
and calibration confidence intervals. 
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XII Preface to the Second Edition 

Let us conclude this preface with the improvements made in nls2. The 
software is now available under Linux and with R 1 as the host system, in 
addition to the Unix/SPlus version. Moreover, a C/Fortran library, called 
nls2C, allows the user to carry out estimation without any host system. And, 
of course, all of the methods discussed in this edition are introduced in the 
software. 



http: / /cran. r-project.org/ 
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Preface to the First Edition 



If you need to analyze a data set using a parametric nonlinear regression 
model, if you are not familiar with statistics and software, and if you make 
do with S-Plus, this book is for you. In each chapter we start by presenting 
practical examples. We then describe the statistical problems posed by these 
examples, and we demonstrate how to solve these problems. Finally, we apply 
the proposed methods to the example data sets. You will not find any math- 
ematical proofs here. Rather, we try, when possible, to explain the solutions 
using intuitive arguments. This is really a cookbook. 

Most of the methods proposed in the book are derived from classical non- 
linear regression theory, but we have also made attempts to provide more 
modern methods that have been proven to perform well in practice. Although 
the theoretical grounds are not developed here, when appropriate we give 
some technical background using a sans serif type style. You can skip these 
passages if you are not interested in this information. 

The first chapter introduces several examples from experiments in agron- 
omy and biochemistry, to which we will return throughout the book. Each 
example illustrates a different problem, and we show how to methodically 
handle these problems using parametric nonlinear regression models. Because 
the term parametric model means that all of the information in the experi- 
ments is assumed to be contained in the parameters occurring in the model, 
we first demonstrate, in Chapter 1, how to estimate the parameters. In Chap- 
ter 2 we describe how to determine the accuracy of the estimators. Chapter 3 
introduces some new examples and presents methods for handling nonlin- 
ear regression models when the variances are heterogeneous with few or no 
replications. In Chapter 4 we demonstrate methods for checking if the assump- 
tions on which the statistical analysis is based are accurate, and we provide 
methods for detecting and correcting any misspecification that might exist. In 
Chapter 5 we describe how to calculate prediction and calibration confidence 
intervals. 

Because good software is necessary for handling nonlinear regression data, 
we provide, at the end of each chapter, a step-by-step description of how to 
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XIV Preface to the First Edition 



treat our examples using nls2 [BH94], the software we have used throughout 
this book. nls2 is a software implemented as an extension of the statistical 
system S-Plus, available at http://www.inra. fr/bia/J/AB/ nls2/, and it offers 
the capability of implementing all of the methods presented in this book. 

Last but not least, we are grateful to Suzie Zweizig for a careful rereading of 
our English. Thanks to her, we hope that you find this book readable! 
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Nonlinear Regression Model and Parameter 
Estimation 



In this chapter we describe five examples of experiments in agronomy and bio- 
chemistry; each illustrates a different problem. Because all of the information 
in the experiments is assumed to be contained in a set of parameters, we will 
first describe, in this chapter, the parametric nonlinear regression model and 
how to estimate the parameters of this model. 1 



1.1 Examples 

1.1.1 Pasture Regrowth: Estimating a Growth Curve 

In biology, the growth curve is of great interest. In this example (treated by 
Ratkowsky [Rat83]), we observe the yield of pasture regrowth versus time 
since the last grazing. The results are reported in Table 1.1 and Figure 1.1. 
The following model is assumed: The yield at time Xi, Yi, is written as Yi = 
f(xi , d)+£i, where f(x , 6) describes the relationship between the yield and the 
time x. The errors e, are centered random variables (E(£j) = 0). Because the 
measurements are in different experimental units, the errors can be assumed to 
be independent. Moreover, to complete the regression model, we will assume 
that the variance of £j exists and equals a 2 . 



Table 1.1. Data of yield of pasture regrowth versus time (the units are not 
mentioned) 



Time after pasture 


9 


14 


21 


28 


42 


57 


63 


70 


79 


Yield 


8.93 


10.8 


18.59 


22.33 


39.35 


56.11 


61.73 


64.62 


67.08 



1 We use this sans serif type style when we give technical background. You can skip 

these passages if you are not interested in this information. 




2 



1 Nonlinear Regression Model and Parameter Estimation 




time 



Figure 1.1. Pasture regrowth example: Observed responses versus time 



The model under study is the following: 

Yi = f(xi,0) + £i ) 

Var(e,) = cr 2 , E (e*) = 0 J ' 

In this example, a possible choice for / is the Weibull model 

f(x, 0) = 0i - 0 2 exp (- exp (0 3 + 0 4 logs)) . 

Depending on the value of 04 , / presents an inflection point or an expo- 
nential increase; see Figure 1.2. 

Generally, the choice of the function / depends on knowing the observed 
biological phenomena. Nevertheless, in this example, Figure 1.1 indicates that 
the response curve may present an inflection point, making the Weibull model 
a good choice. 

Obviously, it is important to choose the function / well. In this chapter 
we will show how to estimate the parameters assuming that / is correct. In 
Chapter 4 we will describe how to validate the choice of /. 

1.1.2 Radioimmunological Assay of Cortisol: Estimating a 
Calibration Curve 

Because the amount of hormone contained in a preparation cannot be mea- 
sured directly, a two-step process is necessary to estimate an unknown dose 
of hormone. First we must establish a calibration curve , and then we invert 
the calibration curve to find the dose of the hormone. 

The calibration curve is estimated by using a radioimmunological assay 
(or RIA). This assay is based on the fact that the hormone H and its marked 



( 1 . 1 ) 

(1.2) 
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Figure 1.2. Growth models described with the Weibull function defined in Equa- 
tion (1.2) 



isotope H* behave similarly with respect to their specific antibody A. Thus, 
for fixed quantities of antibody [A] and radioactive hormone [H*], the quan- 
tity of linked complex [AH*] decreases when the quantity of cold hormone [H] 
increases. This property establishes a calibration curve. For known dilutions 
of a purified hormone, the responses [AH*] are measured in terms of c.p.m. 
(counts per minute) . Because the relation between the quantity of linked com- 
plex [AH*] and the dose of hormone H varies with the experimental conditions, 
the calibration curve is evaluated for each assay. 

Then, for a preparation containing an unknown quantity of the hormone H, 
the response [AH*] is measured. The unknown dose is calculated by inverting 
the calibration curve. 

The data for the calibration curve of an RIA of cortisol are given in Ta- 
ble 1.2 (data from the Laboratoire de Physiologie de la Lactation, INRA). 
Figure 1.3 shows the observed responses as functions of the logarithm of the 
dose, which is the usual transformation for the £-axis. In this experiment, 
the response has been observed for the zero dose and the infinite dose. These 
doses are represented in Figure 1.3 by the values —3 and +2, respectively. 

The model generally used to describe the variations of the response Y ver- 
sus the log-dose x is the Richards function (the generalized logistic function). 
This model depends on five parameters: 02, the upper asymptote, is the re- 
sponse for a null dose of hormone H; 9^, the lower asymptote, is the response 
for a theoretical infinite dose of H; the other three parameters describe the 
shape of the decrease of the curve. The equation of the calibration curve is 

62 - 9 i 

85 ' 



f{x,9) = 9i + 



(1 + exp(0 3 + 9±x)) 



(1.3) 
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Table 1.2. Data for the calibration curve of an RIA of cortisol 



Dose (ng/.l ml) 


Response (c.p.m.) 


0 


2868 


2785 


2849 


2805 


0 


2779 


2588 


2701 


2752 


0.02 


2615 


2651 


2506 


2498 


0.04 


2474 


2573 


2378 


2494 


0.06 


2152 


2307 


2101 


2216 


0.08 


2114 


2052 


2016 


2030 


0.1 


1862 


1935 


1800 


1871 


0.2 


1364 


1412 


1377 


1304 


0.4 


910 


919 


855 


875 


0.6 


702 


701 


689 


696 


0.8 


586 


596 


561 


562 


1 


501 


495 


478 


493 


1.5 


392 


358 


399 


394 


2 


330 


351 


343 


333 


4 


250 


261 


244 


242 


oo 


131 


135 


134 


133 




log-dose 

Figure 1.3. Cortisol example: Observed responses (in c.p.m.) versus the logarithms 
in base 10 of the dose 
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Once we describe the trend of the variations of Y versus x, we also have 
to consider the errors e = Y — f(x, 9). Looking carefully at Figure 1.3, we see 
that the variability of Y depends on the level of the response. For each value 
of x, with the response Y being observed with replications, the model can be 
written in the following manner: Y l3 = f(xi,9 ) + with j varying from 1 
to rii (■ i%i equals 8 or 4) and i from 1 to k ( k = 15). For each value of i the 
empirical variance of the response can be calculated: 



with Y u = Y i Y ij J ' 



(1.4) 



Table 1.3. Cortisol example: Mean and empirical variance of the response for each 
value of the dose 



Dose 


0 


0.02 


0.04 


0.06 


0.08 


Means: Yu 


2765.875 


2567.5 


2479.75 


2194 


2053 


Variances: sf 


6931 


4460 


4821 


5916 


1405 


Dose 


0.1 


0.2 


0.4 


0.6 


0.8 


Means : Yu 


1867 


1364.25 


889.75 


697 


576.25 


Variances: sf 


2288 


1518 


673 


26 


230 


Dose 


1 


1.5 


2 


4 


oo 


Means: Yu 


491.75 


385.75 


339.25 


249.25 


133.25 


Variances: sf 


72 


263 


69 


55 


2 


Table 1.3 gives the values of 


sf and Yu 


. Because the variance of 



sponse is clearly heterogeneous, it will be assumed that Var (e y ) = erf. The 
model is the following: 



Y ij — f{Xi,9) + £ij ) . . 

Var = E(e y ) = 0 J ’ 

where j = 1, . . . rip, i = 1 , k; and the total number of observations equals 
n = f=i n *- f i s defined by Equation (1.3), and the are independent 
centered random variables. In this experiment, however, we can be even more 
precise about the lreteroscedasticity, that is, about the variations of of, be- 
cause the observations are counts that can safely be assumed to be distributed 
as Poisson variables, for which variance equals expectation. A reasonable 
model for the variance could be of = /(xj,0), which will be generalized to 
of = er 2 f(xi,9) T , where r can be known or estimated (see Chapter 3). 

The aim of this experiment is to estimate an unknown dose of hormone 
using the calibration curve. Let p be the expected value of the response for a 
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preparation where the dose of hormone D (or its logarithm X) is unknown; 
then X is the inverse of / calculated in /.t (if /r lies strictly between Q\ and 
0a): 



x = r\ix,e) 



X = 



1 

04 




exp 




0 2 — 0 1 
M - 0i 





( 1 . 6 ) 



This example will be studied in several places in this book. We will discuss 
how to estimate the calibration curve and X in this chapter. The problem of 
calculating the accuracy of the estimation of X will be treated in Chapter 2. 
Methods for choosing the variance function and testing model specifications 
will be proposed in Chapter 4. The calibration problem will be treated in 
detail in Chapter 5. Finally, we show how these data can be fitted using a 
Poisson nonlinear model in Chapter 7. 



1.1.3 Antibodies Anticoronavirus Assayed by an ELISA Test: 
Comparing Several Response Curves 

This experiment uses an ELISA test to detect the presence of anticoronavirus 
antibodies in the serum of calves and cows. It shows how we can estimate 
the parameters of two curves together so that we can ultimately determine 
if the curves are identical up to a horizontal shift. The complete description 
of the experiment is presented in [HLV87]. Here we limit the problem to the 
comparison of antibody levels in two serum samples taken in May and June 
from one cow. 

An ELISA test is a collection of observed optical densities, Y, for different 
serum dilutions, say d. The results are reported in Table 1.4 and Figure 1.4. 



Table 1.4. ELISA example: Observed values of Y for the sera taken in May and 
June. Two values of the response are observed 



dilution 

d 


Response (Optical Density) 


May 


June 


1/30 


1.909 


1.956 


1.886 


1.880 


1/90 


1.856 


1.876 


1.853 


1.870 


1/270 


1.838 


1.841 


1.747 


1.772 


1/810 


1.579 


1.584 


1.424 


1.406 


1/2430 


1.057 


1.072 


0.781 


0.759 


1/7290 


0.566 


0.561 


0.377 


0.376 


1/21869 


0.225 


0.229 


0.153 


0.138 


1/65609 


0.072 


0.114 


0.053 


0.058 



Assuming that the sera are assayed under the same experimental condi- 
tions, the problem is to quantify the differences (in terms of antibody level) 
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Figure 1.4. ELISA example: Observed responses versus the logarithms of the 
dilution, x = — log 10 d 



between the sera using their ELISA response curves. For that purpose, the 
biological assay techniques described in [Fin78] can be used to estimate the 
potency of one serum relative to another serum. The potency p is defined in 
the following manner: One unit of serum taken in May is assumed to produce 
the same response as p units of serum taken in June. This means that the two 
sera must contain the same effective constituent, the antibody, and that all 
other constituents are without effect on the response. Hence one preparation 
behaves as a dilution of the other in an inert diluent. 

Let F May (d) and F June (d) be the response functions for the sera taken in 
May and June, respectively. If the assumptions defined earlier are fulfilled, 
then the two regression functions must be related in the following manner: 
F May (d) = F June (pd). 

The relationship between the optical density Y and the logarithm of the 
dilution x = log 10 (l/d) is usually modeled by a sigmoidal curve: F May (d) = 
f(x,9 May ) and F June (d) = f(x,9 June ), with 



f(x, 9) — 9 1 + 



92 ~ 9 1 

1 + exp0 3 (a: — Q±) ' 



Thus, the estimation of p is based on an assumption of parallelism between 
the two response curves: 



/(M May ) = /0r-/M June ), 



(1.7) 



where (3 = log 10 (l /p). 

In this chapter we will estimate the parameters of the regression functions. 
Obviously, one must also estimate (3. For that purpose, the two curves must 
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be compared to see if they satisfy Equation (1.7). This procedure will be 
demonstrated in Chapter 2. 



1.1.4 Comparison of Immature and Mature Goat Ovocytes: 
Comparing Parameters 

This experiment comparing the responses of immature and mature goat ovo- 
cytes to a hyperosmotic test demonstrates how we can estimate the param- 
eters of two different data sets for the ultimate purpose of determining the 
differences between these sets of parameters. 

Cellular damage that results from exposure to low temperatures can be cir- 
cumvented by the use of permeable organic solvents acting as cryoprotectants. 
Cell water permeability is higher than cell cryoprotectant permeability. Thus, 
when cells are exposed to these solvents, they lose water and shrink in response 
to the variation of osmotic pressure created between the intra- and extracellu- 
lar compartments. Then, as the compound permeates, water reenters the cell, 
and the cell reexpands until the system reaches an osmotic equilibrium. The 
results obtained using immature and ovulated (mature) ovocytes exposed to 
propane-diol, a permeable compound, are presented in [LGR94], 

The cell volume during equilibration is recorded at each time t. In or- 
der to obtain water (P w ) and propane-diol (Ps) permeabilities, the following 
equations are used: 



YM = ±(v w (t) + v s (t) + v x ), 

^ = p V 7i + 72 J_ + 73 M 

d t V 1 V w 7 V w ) 



dK 
d t 



= P, 



^74 + 75 




>• 



(1.8) 



The first equation expresses the cell volume V(t) as a percentage of initial 
isotonic cell volume Vq at any given time t during equilibration; V x is a known 
parameter depending on the stage of the oocyte (mature or immature); V w {t) 
and V s [t) are the volumes of intracellular water and cryoprotectant, respec- 
tively; and the parameters 7 are known. The data are reported in Table 1.5. 

In this example, the variations of Y = V(t)/Vo versus the time t are ap- 
proximated by 

f(t , P w , Ps) = ^r(V w (t) + V s (t) + V x ), 
to 

where V w and V s are the solutions of a system of ordinary differential equations 
depending on P w and P s . 

Our aim here is to estimate the parameters P w and P s for both types of 
ovocytes and to compare their values. We also are interested in the temporal 
evolution of intracellular propane-diol penetration (V s (t)), expressed as the 
fraction of initial isotonic cell volume Vq. 
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Table 1.5. Fraction of volume at time t for two stages (mature and immature) of 
ovocytes 



Time 


Fraction of Cell Volume for Mature Ovocytes | 


0.5 


0.6833 


0.6870 


0.7553 


0.6012 


0.6655 


0.7630 


0.7380 


1 


0.6275 


0.5291 


0.5837 


0.5425 


0.5775 


0.6640 


0.6022 


1.5 


0.6743 


0.5890 


0.5837 


0.5713 


0.6309 


0.6960 


0.6344 


2 


0.7290 


0.6205 


0.5912 


0.5936 


0.6932 


0.7544 


0.6849 


2.5 


0.7479 


0.6532 


0.6064 


0.6164 


0.7132 


0.7717 


0.7200 


3 


0.7672 


0.6870 


0.6376 


0.6884 


0.7494 


0.8070 


0.7569 


4 


0.7865 


0.7219 


0.6456 


0.7747 


0.8064 


0.8342 


0.8100 


5 


0.8265 


0.7580 


0.6782 


0.8205 


0.8554 


0.8527 


0.8424 


7 


0.8897 


0.8346 


0.8238 


0.8680 


0.8955 


0.8621 


0.8490 


10 


0.9250 


0.9000 


0.9622 


0.9477 


0.9321 


0.8809 


0.8725 


12 


0.9480 


0.9290 


1.0000 


0.9700 


0.9362 


0.8905 


0.8930 


15 


0.9820 


0.9550 


1.0000 


1.0000 


0.9784 


0.9100 


0.9242 


20 


1.0000 


0.9800 


1.0000 


1.0000 


1.0000 


0.9592 


0.9779 


Time 


Fraction of Cell Volume for Immature Ovocytes | 


0.5 


0.4536 


0.4690 


0.6622 


0.621 


0.513 






1 


0.4297 


0.4552 


0.5530 


0.370 


0.425 






1.5 


0.4876 


0.4690 


0.5530 


0.400 


0.450 






2 


0.5336 


0.4760 


0.5535 


0.420 


0.475 






2.5 


0.5536 


0.4830 


0.5881 


0.460 


0.500 






3 


0.5618 


0.5047 


0.6097 


0.490 


0.550 






3.5 


0.6065 


0.5269 


0.6319 


0.520 


0.600 






4 


0.6365 


0.5421 


0.6699 


0.550 


0.650 






4.5 


0.6990 


0.5814 


0.6935 


0.570 


0.700 






5 


0.7533 


0.6225 


0.7176 


0.580 


0.750 






6.5 


0.7823 


0.6655 


0.7422 


0.630 


0.790 






8 


0.8477 


0.7105 


0.7674 


0.690 


0.830 






10 


0.8966 


0.8265 


0.8464 


0.750 


0.880 






12 


0.9578 


0.9105 


0.9115 


0.800 


0.950 






15 


1.0000 


0.9658 


0.9404 


0.900 


1.000 






20 


1.0000 


1.0000 


0.9800 


0.941 


1.000 







1.1.5 Isomerization: More than One Independent Variable 

These data, given by Carr [Car60] and extensively treated by Bates and 
Watts [BW88], illustrate how to estimate parameters when there is more than 
one variable. 

The reaction rate of the catalytic isomerization of n-pentane to iso-pentane 
depends on various factors such as the partial pressures of the needed products 
(employed to speed up the reaction) . The differential reaction rate is expressed 
as grams of iso-pentane produced per gram of catalyst per hour, and the 
instantaneous partial pressure of each component is measured. The data are 
reproduced in Table 1.6. 
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Table 1.6. Reaction rate of the catalytic isomerization of n-pentane to iso-pentane 
versus the partial pressures of hydrogen, n-pentane, and iso-pentane 



| Partial Pressure of 


Reaction 

Rate 


Hydrogen 


11 - Pentane 


Iso- Pentane 


205.8 


90.9 


37.1 


3.541 


404.8 


92.9 


36.3 


2.397 


209.7 


174.9 


49.4 


6.694 


401.6 


187.2 


44.9 


4.722 


224.9 


92.7 


116.3 


0.593 


402.6 


102.2 


128.9 


0.268 


212.7 


186.9 


134.4 


2.797 


406.2 


192.6 


134.9 


2.451 


133.3 


140.8 


87.6 


3.196 


470.9 


144.2 


86.9 


2.021 


300.0 


68.3 


81.7 


0.896 


301.6 


214.6 


101.7 


5.084 


297.3 


142.2 


10.5 


5.686 


314.0 


146.7 


157.1 


1.193 


305.7 


142.0 


86.0 


2.648 


300.1 


143.7 


90.2 


3.303 


305.4 


141.1 


87.4 


3.054 


305.2 


141.5 


87.0 


3.302 


300.1 


83.0 


66.4 


1.271 


106.6 


209.6 


33.0 


11.648 


417.2 


83.9 


32.9 


2.002 


251.0 


294.4 


41.5 


9.604 


250.3 


148.0 


14.7 


7.754 


145.1 


291.0 


50.2 


11.59 



A common form of the model is the following: 



f(x,0) 



M 3 (P-//1.632) 

1 + 62H + O3P + O^I 



(1.9) 



where a; is a three-dimensional variate: x = ( H , P, I) where H , P, and / are 
the partial pressures of hydrogen, n-pentane and iso-pentane, respectively. 
In this example we will estimate the 9 ; in Chapter 2 we will calculate the 
confidence intervals. 



1.2 The Parametric Nonlinear Regression Model 

We will use the following notation to define the parametric nonlinear regres- 
sion model: n* replications of the response Y are observed for each value of 
the independent variable xp, let Y ^ , for j = 1 , ...rij, be these observations. 
The total number of observations is n = Xo=i an( l * varies from 1 to k. In 
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Example 1.1.1, n l equals 1 for all values of i, and k = n. In Example 1.1.5, 
the variable x is of dimension three: Xi is a vector of values taken by ( H , P, I). 

It is assumed that the true regression relationship between Y and x is 
the sum of a systematic part, described by a function p(x), and a random 
part. Generally, the function fi(x) is unknown and is approximated by a para- 
metric function /, called the regression function, that depends on unknown 
parameters 9: 

Yj T ■ 

8 is a vector of p parameters 8 \, 82 , ■ ■ ■ 8 p . The function / does not need to be 
known explicitly; in Example 1.1.4, / is a function of the solution of differential 
equations. 

£ is a random error equal, by construction, to the discrepancy between 
Y and f(x,9). Let of be the variance of . The values of of, or their vari- 
ations as functions of Xi, are unknown and must be approximated. In some 
situations the difference in the variances, cr, — cq+i, is small. Thus, there is 
a high confidence in the approximate homogeneity of the variances, and we 
can then assume that Var(e.jj) = a 2 . In other cases, because of the nature of 
the observed response or because of the aspect of the data on a graph, there 
is evidence against the assumption of homogeneous errors. In these cases, the 
true (unknown) variations of of are approximated by a function g called the 
variance function such that Var(£jj) = g(xi, a 2 , 8 ,t). In most cases, g will be 
assumed to depend on /; for example, g(x,cr 2 ,9,T) = cr 2 f(x,d) T , where r 
is a set of parameters that can be assumed to be known or that have to be 
estimated. 

We simplify the necessary technical assumptions by assuming that each 9 a , a = 
1 varies in the interior of an interval. The function / is assumed to be twice 

continuously differentiable with respect to the parameters 8 . 



1.3 Estimation 

Because the problem is to estimate the unknown vector 9, a natural solution 
is to choose the value of 8 that minimizes the distances between the values of 
f(x, 8 ) and the observations of Y . For example, one can choose the value of 8 
that minimizes the sum of squares C{ 8 ) defined by 

c w = EEWi-^’ fl )) 2 - ( L1 °) 

»=1 i=l 

Let 8 be this value. 8 is the least squares estimator of 9. If we assume that 
Var(£jj) = a 2 , an estimate of cr 2 is obtained as 

_ C( 6 ) 



n 



(1.11) 
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Under the assumptions stated in the preceding paragraph, 9 is also the solution of 
the set of p equations: 



Y, 9) ^ ~ f{Xi ’ = 0 > U- 12 ) 



for a = 1, . . .p, where df /dd a is the partial derivative of / with respect to 9 a . 

Because / is nonlinear in 9, no explicit solution can be calculated. Instead an iterative 
procedure is needed. 

The least squares estimator of 9 is also the maximum likelihood estimator in the case 
of Gaussian observations. Recall the main features of the maximum likelihood method. 
Let Y be a random variable distributed as a Gaussian variable with expectation f(x,9) 
and variance a 2 . The probability density of Y at point y is defined as follows: 

l{> • *• "• ^ = Tib “ p (-'"bb" ) ■ 

We observe n = X/!b ni independent Gaussian variables Y tl , j = 1, . . . m, i = 1, . . . k 
with expectation E (Yij) = f(xi,9) and variance Var(Eij) = a 2 . The probability density 
of the observations calculated in yij,j = 1, . . . n;, i = 1, ... At is equal to the following 
formula: 

k nj 
i= 1 3 = 1 

The likelihood is a random variable defined as the probability density calculated in the 
observations: 

k ni 

L(Y n Y knk ■ 9, a 2 ) = R ,Xi,9,a 2 ). 

i = 1 i=i 

The maximum likelihood estimators for 9 and a 2 are those values that maximize the 
likelihood, or equivalently, its logarithm: 

k ni 

V{9,a 2 ) = -^ log(27 ro- 2 ) - ^2 - f( Xi ’ 6 ')) 2 - 

i = 1 3 = 1 



It is clear that maximizing V{9, a 2 ) in 9 is equivalent to minimizing C(9). The maximum 
likelihood estimator of a 2 will satisfy that the derivative of V(9,a 2 ) with respect to a 2 
equals 0. Namely, we get the following: 



dV 



n 



+ 



1 

bb 



C{9) = 0 



leads to Equation (1.11). 

In the case of heterogeneous variances (Var(ejj) = of), it is natural to favor 
observations with small variances by weighting the sum of squares. The af are 
unknown, so they must be replaced by an estimate. If each is big enough, 
say 4, as in Example 1.1.2, then of can be estimated by sf, the empirical 
variance (see Equation (1.4)). The weighted sum of squares is 
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^) = EE 



(Yij - f{xi,0)Y 



i=i i=i 



(1.13) 



and the value of 0 that minimizes W(9) is the weighted least squares estimator 
of 6. The problem of estimating the parameters in lreteroscedastic models is 
treated more generally in Chapter 3. 



1.4 Applications 

1.4.1 Pasture Regrowth: Parameter Estimation and Graph of 
Observed and Adjusted Response Values 

Model The regression function is 

f{x, 0) = 0i - 9 2 exp (- exp (0 3 + 0 4 log x)) , 

and the variances are homogeneous: Var(£j) = a 2 . 

Method The parameters are estimated by minimizing the sum of squares C(0); 
see Equation (1.10). 

Results 



Parameters 


Estimated Values 


9i 


69.95 


02 


61.68 


9 3 


-9.209 


9i 


2.378 



The adjusted response curve , f(x,9 ), is shown in Figure 1.5. 

It is defined by the following equation: 

f(x , 6) = 69.95 — 61.68 exp (— exp(— 9.209 + 2.378 log x)) . 



1.4.2 Cortisol Assay: Parameter Estimation and Graph of 
Observed and Adjusted Response Values 

Model The regression function is 



/(*, 9 ) = 0i + T , 

(1 + exp(0 3 + 9ax)) 5 

and the variances are heterogeneous: Var(£j) = af. 

Method The parameters are estimated by minimizing the weighted sum of 
squares W(9)\ see Equation (1.13). 




